A Practical Approach to Variable Selection — A Comparison of Various Techniques
نویسندگان
چکیده
____________________________________________________________________________________________ Abstract: Selecting a useful list of variables for consideration in a predictive model is a critical step in the modeling process and can result in better models. Sifting through and selecting from a long list of candidate variables can be onerous and ineffective, particularly with the increasingly wide variety of external factors now available from third-party providers. This paper explores a variety of variable selection techniques, applied to frequency and severity models of homeowner insurance claims, developed on a dataset with over 350 initial candidate variables. The techniques are evaluated using multiple criteria, including the predictive power of a resulting model (measured using out-of-sample data) and ease of use. A method based on Elastic Net performs well. Random selections perform as well as some more sophisticated methods, for sufficiently long shortlists.
منابع مشابه
A new quadratic deviation of fuzzy random variable and its application to portfolio optimization
The aim of this paper is to propose a convex risk measure in the framework of fuzzy random theory and verify its advantage over the conventional variance approach. For this purpose, this paper defines the quadratic deviation (QD) of fuzzy random variable as the mathematical expectation of QDs of fuzzy variables. As a result, the new risk criterion essentially describes the variation of a fuzzy ...
متن کاملAn Expert System for Intelligent Selection of Proper Particle Swarm Optimization Variants
Regarding the large number of developed Particle Swarm Optimization (PSO) algorithms and the various applications for which PSO has been used, selecting the most suitable variant of PSO for solving a particular optimization problem is a challenge for most researchers. In this paper, using a comprehensive survey and taxonomy on different types of PSO, an Expert System (ES) is designed to identif...
متن کاملExtended MULTIMOORA method based on Shannon entropy weight for materials selection
Selection of appropriate material is a crucial step in engineering design and manufacturing process. Without a systematic technique, many useful engineering materials may be ignored for selection. The category of multiple attribute decision-making (MADM) methods is an effective set of structured techniques. Having uncomplicated assumptions and mathematics, the MULTIMOORA method as an MADM appro...
متن کاملInterval MULTIMOORA method with target values of attributes based on interval distance and preference degree: biomaterials selection
A target-based MADM method covers beneficial and non-beneficial attributes besides target values for some attributes. Such techniques are considered as the comprehensive forms of MADM approaches. Target-based MADM methods can also be used in traditional decision-making problems in which beneficial and non-beneficial attributes only exist. In many practical selection problems, some attributes ha...
متن کاملFinding stability regions for preserving efficiency classification of variable returns to scale technology in data envelopment analysis
This paper addresses issue of sensitivity of efficiency classification of variable returns to scale (VRS) technology for enhancing the credibility of data envelopment analysis (DEA) results in practical applications when an additional decision making unit (DMU) needs to be added to the set being considered. It also develops a structured approach to assisting practitioners in making an appropria...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015